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Following  de  Broglie’s  idea  of  a  pilot  wave,  this  paper  treats  quan¬ 
tum  mechanics  as  a  problem  of  stochastic  optimal  guidance  law 
design.  The  guidance  scenario  considered  in  the  quantum  world  is 
that  an  electron  is  the  flight  vehicle  to  be  guided  and  its  accompa¬ 
nying  pilot  wave  is  the  guidance  law  to  be  designed  so  as  to  guide 
the  electron  to  a  random  target  driven  by  the  Wiener  process,  while 
minimizing  a  cost-to-go  function.  After  solving  the  stochastic  opti¬ 
mal  guidance  problem  by  differential  dynamic  programming,  we 
point  out  that  the  optimal  pilot  wave  guiding  the  particle’s  mo¬ 
tion  is  just  the  wavefunction  (t ,  x),  a  solution  to  the  Schrodinger 
equation;  meanwhile,  the  closed-loop  guidance  system  forms  a 
complex  state-space  dynamics  for  < P(t,x),  from  which  quantum 
operators  emerge  naturally.  Quantum  trajectories  under  the  action 
of  the  optimal  guidance  law  are  solved  and  their  statistical  distri¬ 
bution  is  shown  to  coincide  with  the  prediction  of  the  probability 
density  function  9* 
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1.  Introduction 


The  subject  of  stochastic  optimal  guidance  in  aerospace  is  concerned  with  determining  control 
policies  which  optimize  some  probabilistic  measure  of  performance  for  flight  vehicles  [1,2],  From  the 
viewpoint  of  optimal  guidance  law  design,  this  paper  aims  to  reveal  an  interesting  analogy  between 
guided  flight  motion  and  guided  quantum  motion  by  showing  that  both  can  be  described  by  the  same 
pursuit-evasion  game.  The  idea  of  a  “pilot  wave”  that  guides  the  movement  of  an  electron  was  first 
suggested  by  de  Broglie  at  the  1927  Solvay  Congress.  According  to  de  Broglie’s  idea,  electrons  have 
a  dual  particle-wave  nature.  Accompanying  every  electron  is  a  wave,  which  guides,  or  “pilots”,  the 
electron  through  space.  After  de  Broglie’s  initiation,  a  substantial  development  of  the  pilot  wave  began 
with  Bohm,  who  pointed  out  that  the  action  of  a  pilot  wave  is  like  that  of  radio  wave  guiding  a  ship. 
With  the  observation  that  the  ship  is  moving  with  its  own  energy,  and  that  the  form  of  the  radio  waves 
is  taken  up  to  direct  the  much  greater  energy  of  the  ship,  Bohm  proposed  that  an  electron  moves 
under  its  own  energy,  and  that  the  pilot  wave  guides  the  energy  of  the  electron.  There  are  three  main 
postulates  in  Bohm’s  proposal  [3]: 

(1)  An  individual  physical  system  comprises  a  wave  propagating  in  space  and  time  together  with  a 
point  particle  which  moves  continuously  under  the  guidance  of  the  wave. 

(2)  The  wave  is  mathematically  described  by  9(t,  x),  a  solution  to  the  Schrodinger  equation. 

(3)  The  particle  motion  is  obtained  as  the  solution  x(t)  to  the  equation 

x  =  (l/m)VSB(x,  t),  (1.1) 

where  SB  is  the  phase  of  =  RB(t,  x)eiSfi(t’x)/\  RB,  SB  e  R. 

Eq.  (1.1)  was  known  as  the  guidance  condition  of  quantum  motion.  It  was  recognized  [4]  later  that 
Eq.  (1.1)  only  represents  some  regular  velocity  v  =  VSB/m,  while  the  actual  velocity  v  =  v  +  f  (t)  has 
an  additional  random  component  £(t).  An  explicit  characterization  of  the  random  velocity  |(t)  was 
derived  by  Bohm  and  Hiley  [5]: 


v  =  v  +  |(t)  = 


VSB 

m 


VRB 


(1.2) 


where  v  is  a  diffusion  coefficient.  This  random  velocity  |  (t)  is  actually  the  osmotic  velocity  originating 
from  the  Brownian  motion.  The  similarities  between  quantum  mechanics  and  Brownian  motion  were 
first  noted  by  Furth  [6]  and  developed  by  Kershaw  [7],  Nelson  [8],  Comisar  [9]  and  Rosenstein  [10]. 
Nelson  [8]  proposed  a  Brownian  model  for  quantum  motion: 


dx  =  b(t,  x)dt  +  dw, 


(1.3) 


which  comprises  a  drift  component  b(t,  x)dt  and  a  diffusion  component  described  by  the  Wiener 
process  dw  with  variance  E(dwtdwj )  =  2vSijdt.  It  can  be  shown  that  Nelson’s  drift  velocity  b(t,  x)  is 
identical  to  the  velocity  v  given  by  Eq.  (1.2).  Nelson  gave  a  stochastic  interpretation  of  the  Schrodinger 
equation  by  showing  that  the  classical  equations  of  motion  for  a  particle  undergoing  a  Brownian 
motion  with  drift  velocity  b(t,  x)  given  by  Eq.  (1.2)  lead  naturally  to  the  Schrodinger  equation,  if  the 
diffusion  coefficient  v  is  chosen  as  v  =  h/2m. 

On  the  other  hand,  Comisar  [9]  connected  Brownian  motion  to  the  path  integral  and  represented 
the  wavefunction  as  a  sum  of  path  integrals  over  Brownian-motion  trajectories.  By  the  analogy 
between  the  free-particle  propagator  and  the  Brownian-motion  Green  function,  Furth  [6]  and 
Comisar  [9]  pointed  out  that  the  path-integral  approach  and  the  Brownian-motion  approach  to 
quantum  mechanics  are  equivalent,  if  the  diffusion  coefficient  v  is  chosen  as  v  =  ih/2m,  which  has 
an  additional  imaginary  factor  i  =  V— 1,  as  compared  to  v  =  h/2m  suggested  by  Bohm  and  Nelson. 
Based  on  this  imaginary  diffusion  coefficient,  Comisar  [9]  proposed  a  complex  Langevin  equation  to 
describe  quantum  velocity 


dv  =  -fivdt  +  z1/2dw.  (1.4) 

Due  to  the  presence  of  the  complex  coefficient  0/2  =  (—  1  +  i) /V2,  which  also  appears  in  our  later 
derivation,  Comisar  concluded  that  the  randomizing  process  in  quantum  mechanics  involves  fictitious 
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collisions  described  by  trajectories  in  a  phase  space  with  complex  velocity  v.  It  can  be  said  that  the  op¬ 
timal  quantum  guidance  law  addressed  in  this  paper  inherits  the  idea  of  guiding  wave  from  Bohm  [3], 
and  adopts  the  randomizing  process  proposed  by  Nelson  [8],  while  confirming  the  complexifying  pro¬ 
cess  proposed  by  Comisar  [9], 

Inspired  by  the  fact  that  classical  mechanics  can  be  derived  by  the  principle  of  least  action  [  1 1  ]  and 
hence  by  the  optimal  control  theory  [2],  we  aim  to  show  that  the  de  Broglie-Bohm  guided-wave  ap¬ 
proach  to  quantum  mechanics  can  be  recast  into  an  optimal  guidance  law  design  problem.  The  equiv¬ 
alent  guidance  scenario  taking  place  in  the  quantum  world  is  that  an  electron  is  the  flight  vehicle  to  be 
guided  and  its  accompanying  pilot  wave  is  the  optimal  guidance  law  to  be  designed  so  as  to  guide  the 
electron  to  a  random  target  driven  by  a  Wiener  process,  while  minimizing  a  prescribed  cost  function. 

An  optimal  guidance  problem  can  be  solved  either  by  the  variational  approach  or  by  the  dynamic 
programming  approach,  but  only  the  latter  is  applicable  to  both  deterministic  and  stochastic  systems. 
Although  the  variational  approach  had  been  employed  to  derive  the  deterministic  wave  equation  by 
Schrodinger  himself  [  12],  it  is  unable  to  provide  us  with  the  stochastic  differential  equations  describ¬ 
ing  the  particle’s  quantum  motion  accompanying  the  wave.  For  the  latter  purpose,  the  dynamic  pro¬ 
gramming  approach  to  optimal  control  becomes  the  only  workable  method.  Here  we  will  show  that 
the  stochastic  Hamilton-Jacobi-Bellman  (HJB)  equation  arising  from  the  dynamic  programming  ap¬ 
proach  [13]  to  optimal  stochastic  control  is  just  the  Schrodinger  equation,  whose  solution  <A(t,  x)  de¬ 
termines  the  optimal  pilot  wave  guiding  a  particle’s  quantum  motion.  Our  result  gives  a  formal  proof  of 
the  role  of  a  wavefunction  9  (t,  x)  as  a  guiding  wave  and  provides  necessary  corrections  to  the  Bohm’s 
postulates  listed  above.  On  the  other  hand,  the  obtained  closed-loop  dynamics  under  the  action  of  the 
optimal  guidance  laws  is  shown  to  yield  a  complex  dynamical  representation  of  the  wavefunction 
'I' (t,x),  which  reveals  the  internal  mechanism  underlying  the  externally  observed  probability  den¬ 
sity  function 

The  remaining  sections  are  organized  as  follows.  Section  2  introduces  a  unified  optimal  control  for¬ 
mulation  of  classical  and  quantum  mechanics  by  the  dynamic  programming  approach.  In  Section  3, 
we  demonstrate  that  the  quantum  motion  of  a  particle  is  guided  by  its  accompanying  pilot  wave  in  an 
optimal  manner  that  minimizes  the  Lagrangian  cost  function  in  the  presence  of  a  stochastic  Wiener 
process.  The  optimal  pilot  wave  is  found  to  be  a  wavefunction  tP(t,  x)  solved  from  the  Schrodinger 
equation.  In  Section  4,  we  show  that  quantum  Hamilton  mechanics  [  14]  proposed  in  the  literature  can 
be  derived  from  the  stochastic  optimal  control  approach,  just  as  classical  Hamilton  mechanics  can  be 
derived  from  the  deterministic  optimal  control  approach.  The  optimal  quantum  trajectories  in  the 
presence  of  Gaussian  white  noise  are  computed  in  Section  5,  where  we  show  that  the  deviation  of  the 
quantum  trajectories  from  their  mean  trajectory  satisfies  the  uncertainty  principle.  The  consistency 
of  the  optimal  quantum  guidance  formulation  with  orthodox  quantum  mechanics  is  demonstrated 
in  Section  6,  where  the  statistical  distribution  of  an  ensemble  of  random  trajectories  computed  from 
Section  5  is  shown  to  reproduce  the  probability  density  functions  'P*(t,  x)V(t,  x)  exactly. 

2.  Optimal  control  approach  to  classical  and  quantum  mechanics 

In  this  section  we  will  survey  the  optimal  control  approach  to  classical  mechanics  and  examine  its 
applicability  to  quantum  mechanics.  It  is  well  known  that  classical  mechanics  can  be  formulated  as  an 
optimal  control  problem,  which  has  been  solved  in  two  different  ways:  the  variational  approach  and 
the  dynamic  programming  approach.  We  first  give  a  brief  review  of  the  two  approaches  and  then  con¬ 
sider  their  validity  in  quantum  mechanics.  Parallel  to  the  situation  that  the  variational  approach  was 
employed  to  give  an  optimal  control  formulation  of  classical  mechanics,  we  find  that  the  dynamic  pro¬ 
gramming  approach  can  be  employed  to  give  an  optimal  control  formulation  of  quantum  mechanics. 

Consider  a  dynamical  system  described  by  the  set  of  differential  equations, 

x=/(t,x,  u),  x(t0)=x0  (2.1) 

where  x  is  the  state  vector  and  u  is  the  control  vector,  which  is  to  be  determined  to  minimize  the 
following  cost  function, 


(2.2) 
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where  L(z,  x(r),  u(r))  is  the  instantaneous  cost  rate.  Classical  mechanics  can  be  converted  into  the 
above  optimal  control  problem  by  choosing  L( u(t))  as  the  Lagrangian  of  the  system  and 
choosing/(t,  x,  u)  =  u.  In  other  words,  classical  mechanics  is  itself  an  optimal  control  design  wherein 
an  optimal  velocity  u  =  x  is  determined  automatically  so  as  to  minimize  the  cost  function J (t0,  x0,  u). 
The  accompanying  Hamiltonian  is  defined  as 

H(t,  x,  u,  p)  =  pTx  -  L(t,  x,  u)  =  pTu  -  L(t,  x,  u),  (2.3) 

where  p  is  a  Lagrange  multiplier  to  be  determined.  The  variational  approach  gives  the  optimal  control 
law  as 


3  H  „  3  L  3  L 

—  =  0  =>  p  =  —  =  — 

3u  3u  3x 


(2.4) 


which  identifies  the  Lagrange  multiplier  p  as  the  generalized  momentum  defined  in  Lagrange 
mechanics.  Meanwhile,  the  variational  approach  derives  the  governing  equations  for  x  and  p  as 


x  = 


3  H 
3p ' 


P 


3  H 
3x 


(2.5) 


which  can  be  solved  to  give  the  optimal  trajectory  x(t)  and  p(t). 

The  optimal  control  u(t)  =  x(t)  obtained  from  the  variational  approach  is  a  function  of  time,  which 
is  valid  only  for  the  system  starting  from  a  specified  initial  state  x(0).  This  type  of  optimal  control  is 
called  open-loop  optimal  control,  because  for  other  initial  conditions,  the  optimal  control  has  to  be 
recomputed  accordingly.  In  1957,  Bellman  proposed  an  alternative  approach  to  optimal  control,  which 
is  valid  for  all  initial  states.  Bellman  wrote  the  following  in  his  book  [13]:  “In  place  of  determining  the 
optimal  sequence  of  decisions  from  the  fixed  state  of  the  system,  we  wish  to  determine  the  optimal 
decision  to  be  made  at  any  state  of  the  system.  Only  if  we  know  the  latter,  do  we  understand  the 
intrinsic  structure  of  the  solution”.  The  approach  realizing  this  idea,  known  as  dynamic  programming, 
leads  to  necessary  as  well  as  sufficient  conditions  for  optimality. 

Instead  of  the  problem  of  minimizing  J(t0,  Xo,  u)  for  given  t0  and  x0,  the  basic  idea  of  dynamic 
programming  is  to  consider  a  family  of  minimization  problems  associated  with  the  cost  functional 


J(t,x,  u) 


L(r,x(r),u(r))dr, 


(2.6) 


where  t  is  the  current  time,  which  may  be  any  instant  between  t0  and  ts,  and  x  =  x(t)  is  the  current 
state.  The  cost  function  J(t,x,  u)  computes  the  cost  accumulated  from  the  current  time  t  to  the 
terminal  time  ts  and  is  recognized  as  the  cost-to-go.  The  dynamic  programming  approach  aims  to 
find  the  optimal  control  u  =  x  to  minimize  the  cost-to-go  J(t,  x,  u)  for  arbitrary  current  time  t  and 
current  state  x.  Under  the  action  of  optimal  control,  the  optimal  cost-to-go  function  is  defined  as 


V(t,x)=  min  J(t,x,  u), 


(2.7) 


Bellman  showed  that  the  optimal  momentum  p  achieving  the  minimum  cost  function  can  be 
expressed  analytically  as 


The  most  remarkable  result  of  dynamic  programming  approach  is  to  characterize  the  partial 
differential  equation  satisfied  by  V(t,  x), 


dV[t,X)  =  H(t,  x,  p)|p=— 3v/3x*  (2.9) 

at 

When  expressed  in  terms  of  the  action  function  S  =  —V,  Eq.  (2.9)  becomes  the  Hamilton-Jacobi 
equation,  which  was  originally  derived  in  the  1830s  as  a  consequence  of  canonical  transformation. 
Bellman  derived  equation  (2.9)  in  1957  by  the  concept  of  dynamic  programming  instead  of  canonical 
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transformation,  and  Eq.  (2.9)  is  now  commonly  referred  to  as  the  Hamilton-Jacobi-Bellman  (HJB) 
equation. 

In  other  words,  Bellman  employed  the  dynamic  programming  approach  to  solve  the  problem  of 
least  action,  which  was  solved  traditionally  by  the  variational  approach.  The  variational  approach 
is  to  find  the  least  action  applied  between  two  fixed  points  (t0,  x0)  and  (t/,  X/),  while  the  dynamic 
programming  approach  is  to  find  the  least  action  applied  between  a  fixed  initial  point  (t0,  x0)  and  a 
varying  terminal  point  (t,  x),  i.e., 


S(t,x)  = 


L(r,  x(r),  u(r))dr. 


(2.10) 


The  resulting  action  function  S(t,  x)  can  be  evaluated  at  arbitrary  t  and  x  to  give  the  least  action  from 
(t0,  x0)  to  (t,  x).  The  comparison  of  the  least  action  function  S(t,  x)  in  Eq.  (2.10)  with  the  optimal  cost 
function  V  (t,  x)  in  Eqs.  (2.6)  and  (2.7)  reveals  that  the  time  t  appears  as  the  upper  limit  of  integration 
in  S(t,  x)  but  as  the  lower  limit  in  V(t,  x).  The  opposite  direction  of  time  integration  accounts  for  the 
fact  that  the  two  partial  differential  equations  governing  S  and  V  become  identical  if  we  set  S  =  —V, 
as  shown  in  Eq.  (2.9). 

In  comparison  with  the  open-loop  optimal  control  expressed  by  Eq.  (2.5),  which  gives  the  optimal 
time  trajectories  x(t)  and  p(t)  emerging  from  the  specific  x(0)  and  p(0),  the  optimal  control  given  by 
Eq.  (2.8)  is  a  closed-loop  strategy,  which  is  solely  based  on  the  current  time  t  and  the  current  state  x 
no  matter  where  the  current  state  emerges  from. 

In  the  absence  of  noise,  it  can  be  shown  that  the  open-loop  optimal  control  given  by  the  variational 
approach  and  the  closed-loop  optimal  control  given  by  the  dynamic  programming  approach  are 
equivalent.  However,  in  the  presence  of  noise,  the  variational  approach  is  invalid,  because  it  is 
no  longer  possible  to  predict  the  unique  optimal  time  trajectories  x(t)  and  p(t)  for  a  given  initial 
condition.  As  to  the  dynamic  programming  approach,  it  remains  valid  under  the  action  of  noise,  since 
its  strategy  is  only  relevant  to  current  time  and  current  state.  It  is  this  feature  that  allows  the  dynamic 
programming  approach  to  be  applicable  to  both  deterministic  and  stochastic  systems.  In  the  following, 
we  will  show  that  quantum  mechanics  can  be  derived  from  the  optimal  control  of  a  stochastic  system 
by  using  dynamic  programming  approach. 

When  an  exogenous  noise  dw  is  applied  to  the  deterministic  system  (2.1),  it  becomes  a  stochastic 
system 

dx  =f(t,  x,  u)dt  +g(x,  u)dw,  (2.11) 

where  w  =  [w\  w2  ■  ■  ■  w„]T  is  the  normalized  Wiener  process  satisfying 

E(dWj)  =  0,  j  =  1,  2 . n;  E(dwjdwk )  =  Sjkdt,  j,k=  1,  2, . . . ,  n.  (2.12) 

The  stochastic  optimal  control  problem  is  to  find  the  guidance  law  u  =  u(t,  x)  to  minimize  the  cost- 
to-go  function  (2.6).  However,  due  to  the  action  of  noise,  the  value  of  the  cost-to-go  function  is  random 
and  only  its  expectation  can  be  minimized, 


J(t,  x,  u)  =  Et>x  ^Jf  L(t,  x(r),  u(r))dr  J  .  (2.13) 

The  subscript  on  the  expectation  value  is  to  remind  us  that  the  expectation  is  over  all  stochastic 
trajectories  that  start  from  x(t)  =  x. 

The  optimal  cost-to-go  function  V(t,  x)  is  still  defined  by  Eq.  (2.7)  but  with  J(t,  x,  u)  now  given  by 
Eq.  (2.13).  Following  the  approach  of  dynamic  programming,  the  main  task  is  to  establish  the  partial 
differential  equation  for  V(t,  x).  For  this  purpose,  we  partition  the  integration  in  Eq.  (2.7)  into  two 
parts  and  rewrite  V(t,  x)  as 


V(t,x) 


a*-ir 


I(T,x(T),U(T))dT 


f. 


L(r,x(r),u(r))dT 


(2.14) 
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According  to  the  definition  of  V(t,  x),  the  second  term  in  the  right-hand  side  is  just  the  cost-to-go 
starting  from  t  +  dt,  i.e.,  V(t  +  dt,  x(t  +  dt)).  Therefore,  we  obtain  a  differential  equation  for  V(t,  x)  as 


V(t,x)  =  min  Et<x\V(t  +  dt,x(t  +  dt))  +  L(z,  x(r),  u(r))dr  | 


(2.15) 


The  optimal  control  u(r,  x(r))  in  Eq.  (2.14)  has  to  be  determined  over  the  interval  t  <  r  <  f/,  while 
in  Eq.  (2.15)  only  the  instantaneous  value  of  u(t,  x(r))  at  t  =  t  is  requested  to  minimize  the  cost 
function  as  dt  — >  0.  The  optimal  cost-to-go  function  V(t,  x)  now  appears  on  the  both  sides  of  Eq. 
(2.15).  In  order  to  solve  V(t,  x)  from  Eq.  (2.15),  we  expand  V(t  +  dt,  x(t  +  dt))  with  respect  to  V(t,  x) 
up  to  the  first  order  of  dt, 


V(t  +  dt,x(t  +  dt))  =  V(t,x)  + 


W(t,xK  ,  dV(t,  x) 


(2.16) 


where  d2V/dx2  denotes  the  Hessian  matrix  of  V  with  its  (j,  k)  entry  given  by  d2V /dXjdXk-  Substitut¬ 
ing  the  above  expansion  into  Eq.  (2.15)  and  evaluating  all  the  expectations  at  the  instant  z  =  t  and 
x(t)  =  x  as  dt  —>  0,  we  obtain  the  following  results: 


T  d2V(t,  x)  ^ 


(  Td2V  \  ,  /  _  _92v  \  /  T  Td2V  \ 

=  V  a„2-0dt  + 2E^  dw  8  ^  dw  g  —gdw 


(  Td2V  \  (  Td2V  \  , 

=  tr(S  M8)dt+\f  Mf)dt 


L(z,x(z),  u(r))dt 


]  =  L(t,  x 


(2.17) 


(2.18) 


(2.19) 

(2.20) 


In  the  derivation  of  Eq.  (2.18)  we  have  used  the  statistical  properties  of  the  Wiener  process  as  stated  in 
Eq.  (2.11).  Eq.  (2.18)  shows  that  a  first-order  term  of  dt  appears  in  the  second-order  expansion  of  dx, 
and  this  fact  explains  why  we  retain  the  second-order  term  of  dx  in  the  expansion  of  V  (t+dt,  x(t+dt)) 
in  Eq.  (2.16). 

With  the  expectations  computed  by  Eq.  (2.17)  to  Eq.  (2.20),  Eq.  (2.15)  in  the  limit  of  dt  ->  0 
becomes  the  stochastic  Hamilton-Jacobi-Bellman  (HJB)  equation  [15-17] 


dV(t,  x)  f  dV(t,x) 

- =  min  { L(t,  x,  u)  -I - f(t,  x,  u) 

dt  uev  [  dx 

1  T  T  d2V(t,x)  1] 

+2ns(x’u)^ig(x’u)JJ’ 
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whose  solution  gives  the  optimal  cost-to-go  function  V(t,  x).  The  minimization  in  Eq.  (2.21)  is  taken 
over  the  set  U,  which  comprises  all  the  possible  instantaneous  values  of  u  at  the  current  time  t.  Let 
u*  =  u*(t,  x,  V([,  x))  be  the  optimal  u  attaining  the  minimum  involved  in  Eq.  (2.21),  where  the  values 
of  V(t,  x)  and  its  derivatives  are  fixed  at  the  current  time  t  and  the  current  state  x.  With  u  replaced 
by  u*(t,  x,  V(C,  x)),  Eq.  (2.21)  becomes  a  partial  differential  equation  for  V(t,  x), 


t,u*)  +  \  jgT(x,u*) 


Once  a  solution  V*(t,  x)  is  solved  from  Eq.  (2.22),  the  optimal  control  is  eventually  determined  as 
u*(t,  x)  =  u(t,  x,  V*([,  x)),  which  gives  the  optimal  control  strategy  at  any  instant  t  and  any  position 
x.  For  a  special  choice  of/(t,  x,  u)  and  g(x,  u),  the  above  process  amounts  to  an  optimal  stochastic 
control  approach  to  quantum  mechanics  and  Eq.  (2.22)  turns  out  to  be  the  Schrodinger  equation. 

Before  we  discuss  the  specific  application  to  quantum  mechanics,  we  consider  a  parallel  example 
to  familiarize  the  above  process  of  optimization.  A  one-dimensional  stochastic  system  is  described  by 


dx  =  (x  +  u)dt  +  (2.23) 

where  df  is  a  Wiener  process  with  mean  (df )  =  0  and  variance  (df2)  =  dt.  The  control  u  is  to  be 
determined  to  minimize  the  quadratic  cost  function, 


J(t,x,  u)=£t,xj^V  +  u2)dTj. 

ments  f  =  x+u,g  =  ^/ i 
antes 

-  =  min  (  (x2  +  u2)/ 2  +  dV(f’*\x  +  u)  -| 
ueu  |  3x 


With  the  replacements/  =  x  +  u,g  =  and  L  =  (x2  +  u2)/ 2,  the  HJB  equation  (2.21)  for  the 
present  case  becomes 


vm,x)i 
2  3x2  J 


The  control  u  minimizing  the  sum  within  the  brace  is  found  as 


u* 


3  V(t,x) 
dx 


(2.26) 


which  in  turn  is  substituted  back  into  Eq.  (2.25)  to  from  a  partial  differential  equation  for  V(t,  x), 


dV_vd2V  1 /3V\2  3V  x2 
~dt  ~  2  Sx2"  ~  2  \  ~dx)  +X3x+~2' 


(2.27) 


According  to  the  structure  of  the  above  equation,  the  solution  of  V(t,  x)  assumes  the  following  form 


V(t,  x)  =  1 p(t)x 2  +  a(t)x  +  m-  (2.28) 

Substituting  this  solution  into  Eq.  (2.27)  and  equating  the  coefficients  of  the  same  powers  of  x  on  the 
both  sides,  we  obtain  the  governing  equations  forp(t),  a(t)  and  /(t)  as 


P  =  P2  —  2p  —  1,  a  =  pa  —  a,  P  =  (-vp  +  a2)/ 2.  (2.29) 

The  optimal  control  is  synthesized  by  combining  Eqs.  (2.26)  and  (2.28)  as 


»ct.  x)  =  -  dV^  X)  =  -p(t)x  -  a(t)  (2.30) 

where  the  time  functions  p(t)  and  a(t)  are  solved  from  Eq.  (2.29).  The  same  procedures  adopted  in 
this  example  will  be  used  in  the  next  section  to  derive  quantum  mechanics  with  a  different  choice  for 
f,g  and  L. 
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3.  Optimal  quantum  guidance  law 


Quantum  mechanics  is  a  consequence  of  the  above  optimal  control  design  for  the  stochastic  system 
(2.1 1 )  with/(x,  u)  =  u  and  g(x,  u)  =  y/vl,  i.e., 


dx  =  udt  +  v^dw,  (3.1) 

where  v  is  a  diffusion  coefficient  to  be  assigned.  Taking  expectation  of  both  sides  of  Eq.  (3.1)  with  the 
property  E  (dw)  =  0,  we  have  x  =  u.  In  other  words,  the  guidance  command  u  determines  the  mean 
velocity  of  the  particle  so  as  to  minimize  the  cost  function  (2.13).  With  the  replacements /(x,  u)  =  u 
andg(x,  u)  =  ^/vl,  the  stochastic  HJB  equation  (2.21)  appears  as 

-  =  mn  |l(t,  x,  u)  +  (W(t,  x))r  u  +  V2V(t,  x) ) .  (3.2) 

The  optimal  u  achieving  the  minimum  inside  the  brace  for  fixed  t  and  x  is  determined  by  the  condition 


3 L(t,  x,  u) 

3u 


—VV(t,  x). 


(3.3) 


We  recall  that  the  right-hand-side  terms  in  Eq.  (3.2)  have  been  evaluated  by  taking  expectation  and  the 
relations  x  =  u  and  L(t,  x,  u)  =  L(t,  x,  x)  have  been  applied.  In  the  case  of  quantum  mechanics,  the 
instantaneous  cost  rate  L(t,  x,  x)  is  chosen  as  the  Lagrangian  of  the  particle.  With  this  particular  choice 
of  I(t,  x,  x),  the  mean  momentum  of  the  particle  is  given  by  its  definition  in  Lagrange  mechanics 


9L(t,  x,  x) 

3x 


(3.4) 


The  combination  of  Eqs.  (3.3)  and  (3.4)  yields  an  expression  of  p  in  terms  of  the  optimal  cost-to-go 
function  V(t,  x)  as 


3 L(t,  x,  x)  3 L(t,  x,  u) 


from  which  the  optimal  guidance  command  u*  can  be  determined  as  a  function  of  t,  x  and  p, 


U*  =U(t,X,  P)|p=— w. 

Further  simplification  can  be  made  by  introducing  the  Hamiltonian 


(3.6) 


H(t,  x,  p)  =  pTx  -  L(t,  x,  x)  =  pTu  -  L(t,  x,  u).  (3.7) 

With  the  substitutions  of  optimal  u  from  Eq.  (3.5)  and  the  Hamiltonian  from  Eq.  (3.7),  the  Stochastic 
HJB  equation  (3.2)  turns  out  to  be 


^  +  H(t,  x,  p)|p=vs  +  ^V2S  =  0, 
at  2. 

where  S  is  the  action  function  defined  as 


(3.8) 


S(t,  x)  =  —V(t,  x).  (3.9) 

For  a  particle  with  mass  m  moving  under  the  action  of  a  potential  U(t,x),  the  Lagrangian  L(t,  x,  x)  has 
a  simple  expression: 

L(t,  x,  x)  =  -xTx  -  U(t,  x),  p  =  m’  X,X)  =  mx.  (3.10) 

2  3x 

The  corresponding  Hamiltonian  H(t,  x,  p)  defined  in  Eq.  (3.7)  reads 
H(t,x,  P)  =  -Vp+U(t,x). 


(3.11) 
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According  to  the  value  assigned  to  the  constant  v,  Eq.  (3.8)  has  three  different  physical  meanings. 

(A)  v  =  0  : 

Eq.  (3.8)  for  this  case  gives  the  familiar  result 

+  H(t,  x,  p)|p=vs  =  f§  +  -J-  (VS)r  (VS)  +  U(t,  x)  =  0,  (3.12) 

at  dt  2m 

which  is  the  classical  Hamilton-Jacobi  (H-J)  equation.  For  this  case,  the  above  derivation  amounts 
to  the  dynamic  programming  approach  to  classical  mechanics  [17], 

(B)  v#0,  v  G  R : 

The  transformation  S(t,  x)  =  mv  In &(t,  x)  leads  Eq.  (3.8)  to 
d&  , 

—  =oiV20  -  (3.13) 

3t 

which  is  a  heat  diffusion  equation  for  the  temperature  difference  <P  =  T  —  Ts  with  surrounding 
temperature  Ts,  thermal  diffusivity  a  =  —v/2  and  radiation  loss  /i<P  =  (U /mv)@. 

(C)  v  #  0,  v  G  C: 

This  is  the  case  corresponding  to  quantum  motion.  The  specific  value  of  the  complex  v  to  be 
considered  is 

v  =  -m/m,  (3.14) 

where  ft  is  Planck  constant.  The  significance  of  this  complex  diffusion  coefficient,  which  appeared 
earlier  in  the  works  of  Furth  [6]  and  Comisar  [9],  lies  in  its  connection  between  the  stochastic  HJB 
equation  and  Schrodinger  equation.  Inserting  this  value  of  v  into  Eq.  (3.8),  we  obtain  a  quantum 
version  of  the  classical  H-J  equation  (3.12) 

^  +  z1-  (VS)T  (VS)  +  U(t,  x)  -  ^-V2S  =  0.  (3.15) 

9t  2m  2m 

With  a  further  transformation 

S(t,x)  =-ift  In  <P(t,x),  (3.16) 

we  obtain  the  Schrodinger  equation 
dV  ft2  , 

ifi —  = - V2'#'  +  U(t,  x)0.  (3.17) 

3t  2m 

Up  to  this  stage,  we  have  proved  that  Schrodinger  equation,  the  kernel  equation  in  quantum 
mechanics,  is  just  the  stochastic  HJB  equation  (2.21)  when  we  solve  a  stochastic  optimal  guidance 
problem  described  by  Eq.  (2.11)  with/(x,  u)  =  u,  g(x,  u)  =  ^/—ift/ml  and  L  =  (m/2)xTx  -  U(t,  x). 
In  terms  of  the  Schrodinger’s  ansatz  (3.16),  we  obtain  an  interesting  link  between  the  optimal  cost 
function  V(t,x)  and  the  wavefunction  0(t,  x): 

V(t,x)  =  ih\n0(t,  x)  (3.18) 

which  symbolizes  the  proposed  interpretation  of  quantum  mechanics  as  an  optimal  control  problem. 

The  next  issue  is  to  clarify  the  relation  between  the  optimal  guidance  law  u  and  the  wavefunction 
0(t,  x)  and  to  confirm  the  de  Broglie-Bohm  postulate  that  <f/(t,  x)  is  the  pilot  wave  guiding  the 
motion  of  a  quantum  particle.  The  optimal  guidance  command  u,  which  determines  the  mean  velocity 
of  the  particle,  is  given  by  Eqs.  (3.5)  and  (3.10) 

u_A_P_  1  3V(t,x)  _  1  3S(t,x)  _  ift  1  3ff(t,x) 

U  X  m  m  3x  m  3x  m  0  3x 

As  expected,  the  particle’s  mean  velocity  is  truly  guided  by  the  wavefunction  0(t,  x).  The  guidance 
law  (3.19)  reproduces  Bohm’s  postulate  (1.1)  from  the  viewpoint  of  stochastic  optimal  guidance  law 
design.  However,  the  two  guidance  laws  given  by  Eqs.  (1.1 )  and  (3.19)  are  not  exactly  the  same  due  to 
their  slight  difference  in  the  definition  of  function  S.  In  Eq.  ( 1.1 ),  SB  is  a  real  function  determined  by  the 
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phase  part  of  =  RB(t,  x)e'Sll(t-x,/'n,  while  in  Eq.  (3.19),  S  is  a  complex  function  related  to  the  complex 
wavefunction  via  In  the  literature,  Eq.  (3.19)  was  derived  alternatively  by  quantum 

Hamilton  mechanics  [14,18]  and  was  recognized  as  a  complex  version  of  Bohm’s  proposal  [19]. 
The  real-valued  quantum  trajectory  determined  by  Eq.  (1.1)  led  to  a  controversy  conclusion  that 
a  particle  is  motionless  in  all  its  eigenstates.  This  difficulty  could  be  conquered  by  the  complex 
quantum  trajectory  x(t)  solved  from  Eq.  (3.19),  which  has  been  recently  developed  into  a  potential 
computational  tool  to  analyze  various  quantum  effects  [20,21], 

The  complex  velocity  derived  in  Eq.  (3.19)  has  a  close  relation  to  the  Brownian  motion  addressed  by 
Nelson  [8],  where  the  current  velocity  (mean  velocity)  vN  and  the  osmotic  velocity  uN  was  introduced 
in  terms  of  the  gradients  of  two  functions  SN  and  RN  as 

vN  =  -VSN,  uN  =  -VRn.  (3.20) 

m  m 

Nelson  showed  that  the  function  formed  by  V  =  eRN+iSN  is  a  solution  to  the  Schrodinger  equation.  If 
the  Nelson's  twin  velocities  vN  and  uN  are  combined  to  form  a  complex  velocity, 

vN  -  iuN  =  —V(SN  -  iRN)  =  —  —  Vln<P  =  x,  (3.21) 

m  m 

we  find  that  the  result  is  just  the  complex  velocity  derived  in  Eq.  (3.19).  In  other  words,  the  real  and 
imaginary  parts  of  the  optimal  velocity  command  u  =  x  are  the  twin  Brownian  velocities  (vN,  — uN) 
defined  in  Nelson’s  work,  and  are  also  identical  to  the  two  velocity  components  (v,  —  |(t))  as  defined 
in  Eq.  (1.2)  by  Bohm  [5], 

4.  Quantum  Hamilton  mechanics 

In  the  last  section  we  have  shown  that  the  quantum  motion  of  a  particle  is  guided  by  its  accompa¬ 
nying  pilot  wave  in  an  optimal  manner  such  that  the  guided  motion  minimizes  the  Lagrangian  cost 
function  in  the  presence  of  a  stochastic  Wiener  process.  The  optimal  pilot  wave  is  found  to  be  the 
wavefunction  t ,  x)  solved  from  the  Schrodinger  equation.  Instead  of  using  the  optimal  guidance 
approach,  the  same  quantum  guidance  law  has  been  derived  by  quantum  Hamilton  mechanics  [14], 
In  this  section  we  will  go  one  step  further  to  show  that  even  quantum  Hamilton  mechanics  itself  can  be 
derived  from  the  proposed  stochastic  optimal  control  approach,  just  as  classical  Hamilton  mechanics 
could  be  derived  from  deterministic  optimal  control  approach. 

Our  aim  is  to  establish  the  Hamilton  equations  of  motion  under  the  action  of  the  above  optimal 
guidance  law  and  meanwhile  to  find  a  quantum  version  of  the  classical  Hamiltonian  (3.11).  The  first 
set  of  Hamilton  equations  deals  with  x,  which  has  been  derived  in  Eq.  (3.19),  while  the  second  set 
deals  with  p,  whose  components  can  be  expressed  as 

dpk  _  d  dS(t,  x)  _  d2S(t,  x)  JL,  92S(t ,  x) . 
dt  dt  dXh  dtdxk  — '  dXjdXk  J’ 

where  Xj  is  given  by  Eq.  (3.19).  As  to  the  first  term  on  the  right-hand  side,  we  employ  the  quantum 
H-J  equation  (3.15)  to  give  an  expression  for  dS/dt, 


(4.2) 


The  substitution  of  Eqs.  (3.19)  and  (4.2)  into  Eq.  (4.1)  yields  the  second  set  of  Hamilton  equations 


(4.3) 
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The  inspection  of  the  structure  in  the  Hamilton  equations  (3.19)  and  (4.3)  motivates  us  to  define  the 
quantum  Hamiltonian  as 

H*(t,  x,  p)  =  ^pTp  +  U(t,  x)  -  ^V2S(t,  x) 

=  — pTp  +  U(t,  x)  -  —V2  In  i A(t,  x).  (4.4) 

2m  2m 

In  terms  of  H&,  Eqs.  (3.19)  and  (4.3)  can  be  incorporated  into  a  standard  form  of  Hamilton  equations 


dx  3H^(t,x,  p)  p 
dt  3p  m’ 


dp  _  3H#(t,x,  p) 

dt  dx 


d_ 

dx 


U(t,  x) - V2ln<A(t,x) 


(4.5a) 

(4.5b) 


This  is  the  closed-loop  mean  dynamics  of  a  quantum  particle  under  the  action  of  the  optimal  guidance 
law.  The  usage  of  the  notation  Hq,  is  to  emphasize  the  dependence  of  the  closed-loop  dynamics  on  the 
optimal  cost  function  V  =  ift  In  !A.  This  property  is  also  called  state-dependence  in  the  terminology  of 
quantum  mechanics.  For  a  given  quantum  state  described  by  the  wavefunction  V,  Eq.  (4.5)  provides 
the  governing  equations  for  a  particle  moving  in  the  quantum  state  V.  It  appears  that  besides 
serving  as  a  probability  density  function,  a  solution  'I'  to  the  Schrodinger  equation  also  intrinsically 
determines  a  set  of  Hamilton  equations  (4.5),  which  forms  a  complex  dynamical  representation  of  the 
quantum  state  V.  The  last  term  in  Eq.  (4.4)  is  known  as  the  complex  quantum  potential, 


(4.6) 


It  can  be  seen  that  without  the  participation  of  this  quantum  potential  the  quantum  Hamilton  equation 
(4.5)  reduces  to  the  classical  Hamilton  equations. 

One  of  the  merits  of  quantum  Hamilton  mechanics  consists  in  the  expression  of  a  quantum 
operator  A  by  its  complex  representation  A  via  the  relation 


A  =  (4.7) 

In  quantum  Hamilton  mechanics,  every  physical  quantity  A  can  be  expressed  as  a  complex  function 
A(x,  p)  of  the  canonical  variables  x  and  p  satisfying  the  Hamilton  equation  (4.5),  and  its  operator 
representation  A  can  be  identified  from  Eq.  (4.7).  Using  this  relation,  we  can  give  a  formal  proof  of  the 
quantization  axiom  p  —  iftV  that  governs  the  critical  transition  from  classical  systems  to  quantum- 
mechanical  systems.  The  canonical  momentum  p  has  been  derived  by  the  optimal  guidance  law  (3.19) 
as 


p  =  VS  =  — ifiV  In  V  =  ^  (— iftV)  9.  (4.8) 

The  comparison  of  the  above  equation  with  the  definition  p  =  (l/>A)piA  gives  the  canonical 
momentum  operator  as 

p  =  — iftV.  (4.9) 

We  say  that  the  momentum  function  p  =  VS  =  —  iftV  In  is  the  complex  representation  of  the 
momentum  operator  p.  Similarly,  the  Hamiltonian  operator  H  can  be  determined  by  its  complex 
representation  /-fy  given  by  Eq.  (4.4), 
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The  comparison  with  the  definition  H  =  (\/'P)H'P  yields  the  Hamiltonian  operator 


H  =  —  - — V2  +  V  =  —  p2 


(4.11) 


For  any  complex  variable  A  defined  in  quantum  Hamilton  mechanics,  we  can  identify  its 
accompanying  operator  A  by  using  Eq.  (4.7).  As  a  representative  example,  we  demonstrate  how 
to  obtain  the  angular  momentum  operator  L  from  its  complex  realization  L.  In  quantum  Hamilton 
mechanics,  the  expression  for  L  =  x  x  p  is  the  same  as  that  in  classical  mechanics  but  with  x  =  [ xyz ] 
and  p  =  [px  py  pz]  satisfying  the  quantum  Hamilton  equation  (4.5),  instead  of  the  classical  Hamilton 
equations.  Evaluating  the  x  component  of  L  with  p  given  by  Eq.  (4.8),  we  obtain 


Lx  =  yPz  ~  zpy 


/  ft  3  In  \  /  ft  3  In  \  ifi  /  3 


Comparing  the  above  equation  with  the  definition  Lx  =  (1/'P)LX*P  gives 


(4.12) 


Lx  =  — i  ft 


_3__  d_\ 

3 z  Z3 yj 


=  yi>z  -  ZPy, 


(4.13) 


where  we  have  used  the  expressions  for  py  and  pz  derived  in  Eq.  (4.9).  The  other  two  components 
Ly  =  zpx  —  xpz  and  Lz  =  xpy  —  ypx  can  be  derived  similarly. 


5.  Quantum  trajectories  under  optimal  guidance  law 


Quantum  Hamilton  mechanics  considers  only  the  mean  motion  of  a  quantum  particle,  because  the 
instantaneous  random  motion  has  been  averaged  out  during  the  process  of  taking  expectation  in  Eq. 
(2.13).  The  actual  quantum  dynamics  is  described  by  the  stochastic  differential  equation  (3.1)  with 
optimal  guidance  command  u  given  by  Eq.  (3.19)  and  v  given  by  Eq.  (3.14), 


dx  =  u(t,  x)dt  +  *Jvdw  =  -  — +  J  —  (— 1  +  i)dw,  (5.1) 

m  dx  V  2m 

where  for  notational  simplicity  only  one  component  of  the  vector  x  is  displayed  above;  equations 
for  remaining  components  can  be  expressed  in  a  similar  way.  Eq.  (5.1)  indicates  that  a  particle’s 
displacement  dx  comprises  the  drift  displacement  u(t,  x)dt  and  the  random  diffusion  displacement 
*Jvdw. 

Eq.  (5.1)  defines  a  Langevin  equation,  where  u(t,  x)  is  in  general  a  nonlinear  function  of  x.  We  can 
numerically  integrate  such  a  nonlinear  Langevin  equation  by  using  the  Euler-Maruyama  method  [22] 
with  a  fixed  time  step  At, 


-(-1  +  OfVZt,  fa  0,1,. 


(5.2) 


where  ~/At  stems  from  the  standard  deviation  of  the  Wiener  process  dw  and  £  is  a  real-valued  random 
variable  with  standard  normal  distribution  N(0,  l),i.e.,E(f)  =  0  and  =  1.  The  partition  of  Eq.  (5.2) 
into  real  and  imaginary  parts  yields  two  coupled  difference  equations: 


ft  /dln^(t,-,x,)\  'pfi  ,  .  .  , 

Re(Xj+i)  =  Re(Xj)  +  -Im  ( - J  At  -  ^  —% sfAt,  j  =  0,  1 , . . . ,  n, 

Im(xJ+i)  =  Im(Xj)  -  —Re  (  X]'  ^  At  +  J — %VAt,  j  =  0, 1, . . . ,  n. 

m  \  dx  )  V  2m 


(5.3a) 


(5.3b) 


Given  an  initial  position  x0,  infinitely  many  quantum  trajectories  can  be  generated  by  Eq.  (5.3)  due 
to  the  randomness  imposed  by  the  random  variable  f .  Upon  solving  Eq.  (5.3),  the  real  part  and  the 
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imaginary  part  ofx  are  coupled  and  have  to  be  solved  simultaneously.  However,  only  the  real  part  ofx 
is  recorded  for  later  use,  because  in  the  probability  interpretation  of  quantum  mechanics,  x  is  treated 
as  a  real-valued  variable.  Our  aim  below  is  to  confirm  that  the  statistical  distribution  of  the  real  part  of 
the  quantum  trajectories  solved  from  Eq.  (5.3)  coincides  with  the  probability  density  function  (PDF) 
1 U*W,  and  at  the  same  time  to  reveal  the  physical  meanings  of  the  imaginary  part  ofx. 

Before  we  compute  the  quantum  trajectory  Xj  from  Eq.  (5.3),  the  optimal  cost  function  V(t,  x)  = 
— S(t,  x)  =  ifi  In ' P ( t ,  x)  has  to  be  solved  in  advance  from  the  quantum  H-J  equation  (3.15)  or  from 
the  equivalent  Schrodinger  equation  (3.17).  Here  we  will  consider  a  Gaussian  wave  oscillating  in  a 
harmonic  potential  U(x)  =  (k/ 2)x2.  The  Schrodinger  equation  with  an  initial  Gaussian  packet 

<P(0,  x)  =  exp  [— (x  -  ip0)2/2] ,  (5.4) 

can  be  solved  analytically  (in  dimensionless  form)  as 

>£(t,  x)  =  7r-1/4e-po/2  exp  i(x  -  ip0  cos  t)2  +  xp0  sint  -  ^  sin(2t)j  .  (5.5) 

This  wavefunction  represents  a  coherent  state  [23]  known  for  the  remarkable  property  that  its  mean 
motion  behaves  exactly  as  a  classical  harmonic  oscillator.  The  probability  density  functions  in  the 
coordinate  space  and  momentum  space  are  given,  respectively,  by 

<£*> P*(t,x)  =  — J-e~(*~Po  sint)\  &*%([,  p)  =  — e-(p-p°cost)2,  (5.6) 

y/n  s/n 

from  which  the  mean  values  and  the  standard  deviations  of  x  and  p  are  computed  as 

£(x)  =  p0  sin  t,  ox  =  y/y2\  E(p)=p0cost,  trp  =  Jyi-  (5.7) 

The  above  wavefunction-based  statistics  can  be  reproduced  from  the  mean  quantum  dynamics 
obtained  by  substituting  Eq.  (5.5)  into  Eq.  (3.19), 

x  =  e-itpo  +  ix.  (5.8) 

Solving  Eq.  (5.8)  with  initial  condition  x(0)  =  0  +  iS,  S  e  R,  yields 

xa(t)  =  posint  —  <5(sint  —  icost),  ps(t)  =  po  cos  t  —  <$(cos  t  —  isint).  (5.9) 

If  we  let  S  =  0  in  Eq.  (5.9),  then  the  mean  quantum  dynamics  recovers  the  mean  values  listed  in  Eq. 
(5.7),  i.e.,  x0(t)  =  £(x)  =  p0  sin  t  and  p0(t)  =  E(p)  =  p0  cos  t.  Subtracting  Eq.  (5.9)  from  their  mean 
values,  we  get  the  position  and  momentum  uncertainties  as 

<4x  =  |xa(t)  -posint|  =  |<$| ,  Ap  =  |ps (t)  —  p0  cos  t|  =  |<$| .  (5.10) 

The  above  relation  characterizes  the  physical  meanings  of  the  imaginary  part  S  =  lm(x(0))  as  the 
quantum  uncertainty  Ax  and  Vp.  The  comparison  between  Eqs.  (5.7)  and  (5.10)  indicates  that  with 
<$  =  lm(x(0))  =  s/\/2  the  probability-based  uncertainties  ax  and  erp,  and  the  trajectory-based 
uncertainty  Ax  and  Vp  become  identical.  The  product  of  the  quantum  certainties  Ax  and  Ap  yields 

AxAp  =  |xs(r)  -posint  |  •  |pa(t)  —  Po  cos  t|  =  1/2,  (5.11) 

which  shows  that  the  coherent  state  achieves  the  lower  bound  of  the  uncertainty  principle  (in 
dimensionless  form).  This  is  a  well-known  result  proved  before  by  using  the  PDFs  <Px*ipx  and  <Pp*if'p, 
but  here  we  obtain  the  same  result  by  the  complex  dynamic  model  for  x(t)  and  p(t). 

We  have  seen  that  the  mean  quantum  trajectory  x0(t)  and  the  boundary  trajectory  Xg(t)  can  be 
solved  from  the  mean  quantum  dynamics  (3.19).  On  the  other  hand,  actual  quantum  trajectories  are 
found  by  solving  Eq.  (5.3)  with  wavefunction  » P(t,  x)  given  by  Eq.  (5.5).  A  total  of  160,000  random 
trajectories  all  starting  from  x0  =  0  +  Oi  are  recorded  and  plotted  as  the  red  curves  in  Fig.  1(a)  and  (c) 
for  the  real  part  and  imaginary  part,  respectively.  The  numerical  averages  of  these  random  trajectories 
are  shown  in  Fig.  1(b)  and  (d).  For  the  purpose  of  comparison,  the  mean  trajectory  x0  determined  from 
Eq.  (5.9)  is  also  shown  in  Fig.  1(a)  and  (c)  as  the  blue  curves.  The  complete  coincidence  of  the  mean 


trajectory  x0  with  the  numerical  average  of  the  random  trajectories  confirms  the  fact  that  the  Hamilton 
equation  (4.5)  represents  the  mean  dynamics  of  the  random  quantum  motion  described  by  Eq.  (5.1). 

Apart  from  providing  a  mean  trajectory  for  random  quantum  motion,  a  complex  trajectory  solved 
from  Eq.  (5.8)  with  initial  position  x^(0)  =  0  +  Si  can  serve  as  a  lower  or  upper  bound  for  random 
quantum  trajectories.  The  complex  trajectory  Xj(t)  with  S  =  ±erx  satisfying 

l*dbar(t)  -  Xo(t)|  =  |x±a(t)  ~  Po  Sint|  =  0X  =  /Y/2,  (5.12) 

forms  a  lcr -deviation  boundary  that  encompasses  68.26%  random  trajectories  in  the  coherent  state 
IP,  as  shown  in  Fig.  2.  One  hundred  and  five  hundred  random  paths  are  generated,  respectively,  by  Eq. 
(5.3)  with  po  =  1.  It  can  be  seen  that  for  both  cases  almost  all  of  the  random  paths  are  enveloped  by 
the  3er  boundary  curve  x±3rr  (t),  which,  as  expected,  is  able  to  cover  99.73%  random  trajectories.  The 
chromatic  level  indicates  the  magnitude  of  the  PDF  whose  distribution  is  consistent  with  the 
concentration  and  dispersion  of  the  random  quantum  trajectories. 


6.  Reconstructing  PDF  by  optimal  quantum  trajectories 

We  have  seen  above  that  a  single  complex  trajectory  Xg(t)  solved  from  the  mean  quantum 
dynamics  (3.19)  forms  a  boundary  for  the  quantum  uncertainty  in  ' P.  In  this  section,  we  proceed 
to  show  that  the  superposition  of  an  ensemble  of  complex  random  trajectories  emerging  from  the 
same  initial  position  can  reproduce  the  statistical  distribution  prescribed  by  <P*<P.  To  verify  the  latter 
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Fig.  2.  The  boundary  of  quantum  uncertainty  for  a  coherent  state  withp0  =  t.  One  hundred  (part  a)  and  five  hundred  (part  b) 
random  paths  are  generated,  respectively,  by  Eq.  (5.3)  with  given  by  Eq.  (5.5).  The  la  and  3a  deviation  boundaries  (yellow 
lines)  computed  from  Eq.  (5.9)  cover,  respectively,  68.26%  and  99.73%  random  quantum  paths  (red  lines).  The  chromatic  level 
illustrates  the  magnitude  of  the  PDF  'P*'P  whose  value  is  consistent  with  the  concentration  and  dispersion  of  the  computed 
random  quantum  trajectories.  (For  interpretation  of  the  references  to  colour  in  this  figure  legend,  the  reader  is  referred  to  the 
web  version  of  this  article.) 

meaning  of  x(t)  via  the  example  of  the  coherent  state,  we  construct  the  point  set  {Re(Xj)},  which 
contains  N  complex  trajectories  Xj  solved  from  Eq.  (5.3),  and  then  compare  the  statistical  distribution 
of  this  point  set  with  the  distribution  of  *P*  ( t ,  x)^(t,  x)  given  by  Eq.  (5.6).  Different  sizes  of  ensemble 
of  the  trajectories  with  N  =  100,  1000,  10,000  and  100,000  are  used  to  construct  the  point  set 
{Re(Xj)},  whose  spatial  distributions  at  the  instant  t  =  10.8  are  shown  in  Fig.  3.  It  can  be  seen  that 
as  the  size  of  the  ensemble  increases,  the  spatial  distribution  of  the  point  set  {Re(Xj)}t=io.8  converges 
gradually  to  the  distribution  of  the  PDF  &*(t,  x)<^(t,  x)  evaluated  at  t  =  10.8. 

It  is  noted  that  all  the  random  trajectories  here  emerge  from  the  same  initial  position  x(0)  =  0+Oi. 
By  contrast,  the  quantum  trajectories  considered  in  the  existing  literature  start  from  different  initial 
positions  whose  distribution  is  deliberately  selected  in  order  to  generate  an  ensemble  of  trajectory 
compatible  with  x)^(t,  x).  The  stochastic  differential  equation  (5.1)  derived  from  the  optimal 
guidance  approach  automatically  generates  compatible  trajectories  no  matter  how  the  initial  posi¬ 
tions  of  the  trajectories  are  prepared. 

To  quantify  the  matching  error  between  the  position  distributions  of  {Re(x,)j  and  the  PDF 
>^*(t,  x)^x(t,  x),  x  e  R,  the  correlation  coefficient  r  between  them  is  calculated, 

Sab 

r  -  •  ^  .  i  <  r  <  i,  (6.i) 

where 

Sm  =  Y^(Aj-A)2,  Sbb  =  £(Bj-B)2,  Sab  =  XJ(Aj.-A)(Bj-B),  (6.2) 

with  data  Aj  obtained  from  the  point  set  {Re(Xj)}  and  Bj  from  the  PDF  'P*'PX.  We  can  see  from  Fig.  3 
that  the  correlation  coefficient  approaches  to  1  as  the  size  of  the  trajectory  ensemble  increases. 

The  spatial  distributions  of  {Re(Xj)}  observed  at  six  instants  t  =  2,  4,  6,  8,  10  and  12  with  a  fixed 
size  of  ensemble  N  =  100,000  are  shown  in  Fig.  4.  Because  all  the  trajectories  start  from  the  same  ini¬ 
tial  point  x0  =  0  +  Oi,  in  the  beginning  the  trajectories  do  not  fully  disperse  over  the  real  axis  and  the 
initial  distribution  of  (Re(Xj)}  is  very  different  from  the  distribution  of  'P*'PX  (green  line  in  Fig.  4).  As 
time  proceeds,  enough  spatial  points  are  reached  by  the  trajectories  so  that  the  distribution  of  {Re(Xj)} 
gradually  evolves  into  a  fixed  pattern  identical  to 

The  Wiener  process  w  in  the  stochastic  differential  equation  (5.1)  is  critical  to  the  guidance  inter¬ 
pretation  of  quantum  mechanics.  It  tells  us  how  to  generate  the  noise  such  that  the  resulting  random 
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trajectories  can  reproduce  the  statistical  property  of  4*.  In  Eq.  (5.2),  the  random  variable  f  is  added  to 
the  iteration  at  every  time  step  At.  As  At  ->  0,  the  trajectory  becomes  fractal  and  non-differentiable 
everywhere.  For  the  purpose  of  comparison,  we  remove  the  Wiener  process  from  Eq.  (5.1)  to  see 
whether  the  remaining  deterministic  differential  equation,  i.e.,  Eq.  (3.19),  can  produce  quantum 
motion  consistent  with  the  description  of  the  wavefunction  A  straightforward  discretization  of 
Eq.  (3.19)  yields 

%i  =  xj  ~  - - At ,  j  =  0, 1, . . . ,  n.  (6.3) 

For  a  given  initial  position  x0,  only  one  trajectory  can  be  generated  from  Eq.  (6.3).  Uncertainty  has  to 
be  imposed  on  x0  in  order  to  yield  an  ensemble  of  deterministic  trajectories.  The  distribution  of  initial 
positions  is  suggested  as 


x0  =  0  +  zW— ,  £«N(  0,1). 


(6.4) 


The  real  part  of  x0  is  still  fixed  at  zero,  but  its  imaginary  part  is  now  generated  by  the  random  variable 
£  as  suggested  by  Eq.  (5.3b).  This  arrangement  of  the  initial  condition  is  to  reflect  the  situation,  taking 
place  in  the  quantum  world,  that  even  though  we  have  precise  information  about  the  initial  position 
Re(x0)  and  the  governing  equation  (6.3),  the  resulting  trajectory  is  still  uncertain  due  to  the  influence 
of  the  unobservable  part  Im(x0). 
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Fig.  4.  The  reproduction  of  'P"  'P  for  a  Gaussian  wave  packet  with  harmonic  oscillation  by  100,000  random  trajectories  at  six 
time  instants  (a)  t  =  2,  (b)  t  =  4,  (c)  t  =  6,  (d)  t  =  8,  (e)  t  =  10,  and  (f)  t  =  12.  As  time  evolves,  the  statistical  distribution  of 
the  100,000  trajectories  converges  to  the  prediction  of  '!'*<!' .  (For  interpretation  of  the  references  to  colour  in  this  figure  legend, 
the  reader  is  referred  to  the  web  version  of  this  article.) 


A  total  of  100,000  values  of  Im(x0)  are  selected  according  to  the  distribution  of  f  and  100,000  de¬ 
terministic  complex  trajectories  are  computed  from  Eq.  (6.3).  The  point  set  {Re(x,)}  of  the  resulting 
100,000  trajectories  is  created  and  its  correlation  coefficient  rD(t)  with  respect  to  the  PDF  is 
computed  as  a  function  of  time  as  shown  in  Fig.  5.  Also  shown  in  Fig.  5  is  the  correlation  coefficient 
rR(t)  based  on  100,000  random  complex  trajectories  computed  from  Eq.  (5.3).  Although  all  the  ran¬ 
dom  trajectories  start  from  the  same  position  Re(x0)  =  Im(x0)  =  0,  the  addition  of  the  random 
variable  f  at  every  time  step  diversifies  the  trajectories  quickly  so  as  to  develop  a  statistical  pattern 
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Fig.  5.  Time  response  of  two  correlation  coefficients  for  a  coherent  state.  The  blue  line  is  the  correlation  coefficient  Tb(t) 
between  the  deterministic  complex  trajectories  solved  from  Eq.  (6.3)  and  the  PDF  <P*<P,  while  the  red  line  is  the  correlation 
coefficient  rR(t )  between  the  random  complex  trajectories  solved  from  Eq.  (5.3)  and  the  PDF  (For  interpretation  of  the 
references  to  colour  in  this  figure  legend,  the  reader  is  referred  to  the  web  version  of  this  article.) 

approaching  to  'I'*'!',  as  is  reflected  in  Fig.  5,  where  rR(t)  increases  monotonically  to  1  as  time  pro¬ 
ceeds.  Indeed,  the  choice  of  Re(x0)  is  arbitrary;  random  trajectories  solved  from  Eq.  (5.3)  are  allowed 
to  start  with  different  initial  positions  without  changing  the  property  of  convergence  rR(t )  1, 

as  t  ->  oo.  Especially,  if  Re(x0)  is  chosen  randomly  according  to  the  distribution  of  the  initial  PDF 
0,  x),  then  we  will  have  a  perfect  reconstruction  of  ip*\ P(t,  x)  by  the  point  set  {Re(Xj)}  at  any 
instant,  i.e.,  rR(t)  =  1,  Vt  >  0. 

On  the  other  hand,  the  correlation  coefficient  rD(t)  constructed  from  the  deterministic  model  has  a 
periodic  oscillation  as  shown  in  Fig.  5,  being  unable  to  reach  a  steady  value  at  1.  The  periodic  nature  of 
rD(t )  originates  from  the  deterministic  equation  (6.3),  whose  solutions  lose  diversity  in  the  beginning 
and  in  the  end  of  each  period  of  motion,  where  all  the  trajectories  start  from  or  converge  to  the 
same  point  Re(x0)  =  0.  In  summary,  the  deterministic  dynamics  given  by  Eq.  (3.19)  or  Eq.  (6.3)  is 
helpful  to  determine  mean  quantum  motion  and  to  estimate  the  boundary  of  quantum  uncertainty. 
As  long  as  the  actual  quantum  trajectory  is  concerned,  the  Wiener  process  dw  has  to  be  superposed 
on  the  deterministic  dynamics  to  form  the  stochastic  differential  equation  (5.1),  whose  solutions 
automatically  manifest  the  statistic  behavior  prescribed  by  the  PDF  <*'*<#',  no  matter  what  the  initial 
positions  are.  The  stochastic  differential  equation  (5.1),  which  is  derived  in  this  paper  by  the  approach 
of  stochastic  optimal  guidance  design,  serves  as  a  complex  representation  of  the  quantum  state  V, 
which  reveals  the  internal  mechanism  underlying  the  externally  observed  PDF  &*&_ 

7.  Conclusions 

In  this  paper,  de  Broglie’s  pilot  wave  idea  is  proved  and  realized  by  stochastic  optimal  guidance 
design  for  an  equivalent  guidance  problem  within  which  a  quantum  particle  is  regarded  as  the  flight 
vehicle  to  be  guided  and  its  accompanying  pilot  wave  is  the  guidance  law  to  be  designed  so  as  to  guide 
the  particle  to  a  random  target  driven  by  a  stochastic  Wiener  process,  while  minimizing  the  Lagrangian 
cost  function.  Several  significant  findings  have  been  revealed  by  this  innovative  interpretation  of 
quantum  mechanics. 

1.  The  Schrodinger  equation  is  just  the  stochastic  Hamilton-Jacobi-Bellman  (HJB)  equation  arising 
from  solving  the  proposed  optimal  control  problem  by  a  dynamic  programming  approach,  and  the 
optimal  guidance  law  (the  pilot  wave)  is  synthesized  by  the  wavefunction  ^(t,  x),  a  solution  to  the 
Schrodinger  equation. 
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2.  The  quantum  noise  is  engaged  to  the  guidance  system  via  a  complex  gain  *J—ih/m  so  that  the 
closed-loop  quantum  guidance  system  can  be  represented  by  a  complex  state-space  model  from 
which  quantum  operators  emerge  automatically. 

3.  A  quantum  displacement  contains  two  components.  The  deterministic  component  is  called  drift 
displacement  which  is  driven  by  the  optimal  guidance  command.  The  random  component  is  called 
diffusion  displacement  which  is  driven  by  a  Wiener  process.  The  former  characterizes  the  mean  and 
variance  of  quantum  trajectories,  while  the  latter  contributes  randomness  and  fractal  to  quantum 
trajectories. 

4.  The  optimal  guidance  approach  to  quantum  mechanics  yields  a  stochastic  differential  equation, 
from  which  quantum  trajectories  can  be  solved  and  visualized  with  any  given  resolution.  The 
statistical  distribution  of  the  resulting  quantum  trajectories  is  shown  to  consist  with  the  prediction 
of  the  probability  density  function  'I'*'!'.  In  other  words,  the  random  trajectories  solved  from  Eq. 
(5.1)  actually  give  an  explicit  characterization  of  the  quantum  paths  proposed  by  Feynman  in  his 
path  integral  approach  to  quantum  mechanics. 
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